Large-Scale Image Mining with Flickr Groups

نویسندگان

  • Alexandru-Lucian Gînsca
  • Adrian Popescu
  • Hervé Le Borgne
  • Nicolas Ballas
  • Dinh-Phong Vo
  • Ioannis Kanellos
چکیده

The availability of large annotated visual resources, such as ImageNet, recently led to important advances in image mining tasks. However, the manual annotation of such resources is cumbersome. Exploiting Web datasets as a substitute or complement is an interesting but challenging alternative. The main problems to solve are the choice of the initial dataset and the noisy character of Web text-image associations. This article presents an approach which first leverages Flickr groups to automatically build a comprehensive visual resource and then exploits it for image retrieval. Flickr groups are an interesting candidate dataset because they cover a wide range of user interests. To reduce initial noise, we introduce innovative and scalable image reranking methods. Then, we learn individual visual models for 38,500 groups using a low-level image representation. We exploit off-the-shelf linear models to ensure scalability of the learning and prediction steps. Finally, Semfeat image descriptions are obtained by concatenating prediction scores of individual models and by retaining only the most salient responses. To provide a comparison with a manually created resource, a similar pipeline is applied to ImageNet. Experimental validation is conducted on the ImageCLEF Wikipedia Retrieval 2010 benchmark, showing competitive results that demonstrate the validity of our approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving the Fisher Kernel for Large-Scale Image Classification

The Fisher kernel (FK) is a generic framework which combines the benefits of generative and discriminative approaches. In the context of image classification the FK was shown to extend the popular bag-of-visual-words (BOV) by going beyond count statistics. However, in practice, this enriched representation has not yet shown its superiority over the BOV. In the first part we show that with sever...

متن کامل

JustClick: Personalized Image Recommendation via Exploratory Search from Large-Scale Flickr Image Collections

In this paper, we have developed a novel framework called JustClick to enable personalized image recommendation via exploratory search from large-scale collections of manuallyannotated Flickr images. First, a topic network is automatically generated to summarize large-scale collections of manuallyannotated Flickr images at a semantic level. Hyperbolic visualization is further used to enable int...

متن کامل

Internet Multimedia Search and Mining

We present in this chapter a review of current work that leverages on large online social networks’ meta-information, in particular Flickr Groups. We briefly present this hugely successful feature in Flickr and discuss the various ways in which metadata stemming from users’ interactions with and within groups has been exploited by researchers to improve on state-of-the-art search and browsing a...

متن کامل

Developing metrics to characterize Flickr groups

Flickr, the large-scale online photo sharing website, is often viewed as one of the ‘classic’ examples of Web2.0 applications through which researchers are able to observe the social behavior of online communities. One of the main features of Flickr is groups. These provide a means to organize, share and discuss photos of potential interest to group members.This paper explores the scale of grou...

متن کامل

Knowledge Discovery from Community-Contributed Multimedia

T he prevalence of imageand videocapturing devices and the advent of media-sharing services such as Flickr and YouTube have drastically increased the volume of community-contributed multimedia. For example, there are reportedly more than four billion images in Flickr and 24 hours of new videos are uploaded to YouTube every minute. Such a vast amount of photos, videos, and music shared via websi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015